Storage device with enclosure power control

ABSTRACT

The present invention makes is possible to appropriately set the power saving control of the storage device from the management device of the storage device. In addition, the storage device executes control of the power saving for the magnetic disk device after ensuring consistency between an instruction from the administrator and the operating state of the magnetic disk device in the storage device.

CROSS-REFERENCE TO PRIOR APPLICATION

This application relates to and claims the benefit of priority fromJapanese Patent Application number 2007-48331, filed on Feb. 28, 2007,and Japanese Patent Application number 2007-261511, filed on Oct. 5,2007 and is a continuation application of U.S. application Ser. No.11/968,345, filed Jan. 2, 2008, now U.S. Pat. No. 7,904,651, the entiredisclosures of which are incorporated herein by reference.

BACKGROUND

The present invention generally relates to a storage device.

The method generally used to control a magnetic disk device involvesturning the power ON when access is received, executing rotation of themagnetic storage medium, responding to access by a host computer, and,after access is complete, turning OFF the power of the motor used torotate the magnetic storage medium in a stepwise fashion. That is,magnetic disk devices contain various power saving modes and mainlypossess a function to select a predetermined power saving mode whenaccess is being received and to automatically shift to this mode.

A storage device comprises a storage unit in which a plurality ofmagnetic disk devices are connected in the form of an array and providesa host computer such as a server with logical storage areas (‘LU’hereinbelow) comprising a plurality of magnetic disk devices. In orderto increase reliability and so forth, the storage device provides thehost computer with RAID (Redundant Array of Independent Disks)-basedredundant LU. In such a storage device, in cases where there is verylittle access by the host computer, there is no need for all themagnetic disk devices to be operating. However, when a magnetic diskdevice temporarily stops rotating, same is unable to immediately respondto access by the host computer. Usage of the above control method thatis generally employed for magnetic disk devices means that it takes timefor the magnetic disk devices to operate in the event of an accessrequest and the overall performance of the storage device dropsconsiderably.

Therefore, in the case of the technology that appears in JapaneseApplication Laid Open No. 2000-293314, a technology which manages thepositions of the magnetic disk devices that constitute the LU as well asthe state of the access by the host computer to the logical volumes andwhich, in cases where there is no access to a logical volume in amagnetic disk device for a predetermined time (referred to as ‘period A’hereinbelow), selects one of a plurality of power saving modes andshifts the magnetic disk device to the selected power saving mode hasbeen disclosed (See Japanese Application Laid Open No. 2000-293314).

According to the technology that appears in Japanese Application LaidOpen No. 2000-293314, in cases where the storage device provides anapplication for which access occurs intermittently with an LU, forexample, a need sometimes arises to set period A which is the period inwhich it is judged that there is no access by the application to severalhours depending on the access cycle of the application.

However, even in the case of such an application for which access occursintermittently, the operation is sometimes such that access to thestorage device does not occur for a long period as is the case during ascheduled stoppage at night. In the case of such an operation, amagnetic disk device is desirably shifted to a variety of power savingmodes at the moment when the scheduled stoppage takes place in order toreduce power consumption.

However, with the technology that appears in Japanese Application LaidOpen No. 2000-293314, unless a number of hours have elapsed since thestorage device is accessed, the processing to shift the magnetic diskdevice to a variety of power saving modes is not executed. That is,excess power is consumed by a magnetic disk device which is known tohave not been accessed by an application for several hours.

SUMMARY

Therefore, the present invention provides a storage device which allowspower saving control for magnetic disk devices to be appropriately setby a management device of the storage device.

Meanwhile, the LU which the storage device provides for the applicationis arranged distributed between a plurality of magnetic disk devices.Furthermore, the storage device is often shared by a plurality ofapplications or a plurality of administrators. Hence, because the powersaving control of the magnetic disk devices is performed in accordancewith instructions from each of the administrators based on thecircumstances of the operation of each application, there are sometimesinconsistencies with the operating states of other applications and theinstructions of the other administrators. That is, in cases where anLU-A used by an application A during a scheduled stoppage and an LU-Bwhich is used by an application B during operation are disposed in thesame magnetic disk device, the storage device shifts to a magnetic diskdevice power saving mode which corresponds with the LU-A in accordancewith an instruction from the administrator of application A and isunable to respond to access to the LU-B by application B.

Therefore, the present invention provides a method that executes powersaving control after ensuring consistency between instructions fromadministrators and the operating states of the magnetic disk devices inthe storage device.

In addition, even when consistency between an instruction from anadministrator and the operating state of a magnetic disk device in thestorage device has not been ensured, consistency is sometimes ensuredsubsequently as a result of changes in the operating states of each ofthe applications. In other words, even when application B is operatingat the moment when there is an instruction to shift the magnetic diskdevice to a power saving mode from the administrator of application Aduring a scheduled stoppage, application B will subsequently sometimesno longer operate. Therefore, the present invention provides a methodaccording to which, even when consistency has not been ensured betweenan instruction from the administrator and the operating state of amagnetic disk device in the storage device, the storage device receivethe instruction from the administrator, monitors the usage states of theother LU until consistency has been ensured, waits until consistency hasbeen ensured, and executes control to reduce the power consumption ofthe magnetic disk device at the point where the operation with respectto the magnetic disk device by application B has not been performed fora fixed period.

The storage device of the present invention comprises a plurality ofmagnetic disk devices; a power supply unit which supplies power to themagnetic disk devices; a rotational state control unit which transmits asignal for controlling the rotational state of motors of the magneticdisk devices to the magnetic disk devices; and a disk control unit whichprovides a host computer with a logical storage area comprising storageareas of the plurality of magnetic disk devices. The storage devicereceives an instruction to shift the rotational state of the magneticdisk device from the management device for each LU which constitutes aunit for access by the host computer.

The storage device comprises access information which indicates, foreach of the logical storage areas, access by the host computer andmanagement information which indicates the arrangement of the LU in themagnetic disk device. In cases where there is an instruction from themanagement device to shift the rotational state of the magnetic diskdevice and in cases where access with respect to all the LU in themagnetic disk device which corresponds with the LU for which theinstruction is issued has not occurred for a predetermined time, controlis executed to shift the rotational state of the magnetic disk devicewhich corresponds with the LU for which the instruction is issued.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a first configuration view of a storage system;

FIG. 2 shows a local memory;

FIG. 3 shows LU management information;

FIG. 4 shows RAID group configuration management information;

FIG. 5 shows the process flow of a control program;

FIG. 6 is a conceptual view of the storage area of a magnetic disk;

FIG. 7 shows information on whether disks can be turned OFF;

FIG. 8 shows the process flow of a judgment program;

FIG. 9 shows a second configuration view of the storage system;

FIG. 10 shows enclosure management information;

FIG. 11 shows the process flow for power OFF in enclosure units by thecontrol program;

FIG. 12 is a conceptual view of the processing of the whole system; and

FIG. 13 shows a GUI.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

An embodiment of the present invention will be described hereinbelowwith reference to the drawings. FIG. 1 is a configuration view of astorage system 100 according to this embodiment.

The storage system 100 comprises a host computer 101, a managementdevice 102, and a storage device 103. The storage device 103 comprises aplurality of magnetic disk devices 104, a power supply unit 105 whichsupplies power to the magnetic disk devices 104, a power supply controlcircuit 106 which performs power supply switching, and a disk controldevice 107. The host computer 101 and management device 102 areconnected to the plurality of magnetic disk devices 104 via the diskcontrol device 107.

The host computer 101 is a computer that comprises informationprocessing resources such as a CPU (Central Processing Unit) and memory,for example, and is specifically constituted by a personal computer, aworkstation, or a mainframe or the like. The host computer 101 setslogical storage areas (‘LU’ hereinbelow) which are provided by the diskcontrol device 107 as targets and transmits data I/O request commands tothe disk control device 107. The disk control device 107 provides thehost computer 101 with storage areas which are arranged distributedbetween a plurality of magnetic disk devices 104 as one LU. The diskcontrol device 107 provides the host computer 101 with a plurality ofLU. The data which are stored in the magnetic disk device 104 areafforded RAID (Redundant Arrays of Inexpensive Disks)-based redundancy.A set of a plurality of magnetic disk devices 104 with a RAIDconfiguration is known as a RAID group. The respective LUs are arrangedacross storage areas of the magnetic disk devices in the RAID group.Furthermore, the storage device has a plurality of RAID groups.

The management device 102 is connected to the storage device via aninterface 108 and transmits instructions to the storage device 103 tochange the settings of the configuration information of the storagedevice 103 such as, for example, to change the settings of the RAIDgroups or change the settings of the LU management information(described subsequently).

The disk control device 107 is constituted by two controllers 1071A and1071B and an inter-controller connection path 109 which connects thecontrollers 1071A and 1071B in a communicable state. By making thecontrollers 1071A and 1071B redundant, when a fault has occurred in onecontroller, the other controller can be operated. In addition, in caseswhere the load is biased toward one controller, the load can bedistributed by operating the other controller.

The respective controllers 1071A and 1071B control the reading andwriting of data to and from the magnetic disk devices 104 in accordancewith requests of the host computer 101 and comprise host interfaces1072A and 1072B, data transfer control units 1073A and 1073B, cachememory units 1074A and 1074B, bridges 1075A and 1075B, local memoryunits 1076A and 1076B, processors 1077A and 1077B, disk interfaces 1078Aand 1078B, and so forth. The respective elements in the controller areconnected by a data transfer line 110 and a control command transferline 111.

Of these, the host interfaces 1072A and 1072B are interfaces whichperform communication control with respect to the host computer 101.

The data transfer control units 1073A and 1073B possess a function fordata transfers between the two controllers 1071A and 1071B and for datatransfers between the respective elements in the controllers 1071A and1071B.

The bridges 1075A and 1075B are relay devices which connect therespective processors 1077A and 1077B and the respective local memoryunits 1076A and 1076B to the respective data transfer control units1073A and 1073B in their own systems. Here, ‘their own systems’ refersto the respective constituent elements in the same controller. However,the respective constituent elements in the other controller will also becalled the ‘other system.’

The local memory units 1076A and 1076B are used in order to temporarilyhold the various commands such as read commands or write commands whichare supplied by the host computer 101. The processors 1077A and 1077Bprocess the read commands and write commands which are stored in thelocal memory units 1076A and 1076B. In addition, as will be describedsubsequently, the local memory units 1076A and 1076B have variouscontrol programs and these programs are executed and processed by theprocessors 1077A and 1077B.

The processors 1077A and 1077B each have a function that governs controlof the overall operation of the controllers 1071A and 1071B of theirrespective own systems. The processors 1077A and 1077B performprocessing to read and write data to and from the magnetic disk devices104 in accordance with write commands and read commands for LU which arestored in the local memory units 1076A and 1076B. The respectiveprocessors 1077A and 1077B perform processing to read and write data toand from those LU which have been exclusively assigned to their ownprocessor beforehand (called ‘associated LU’ hereinbelow).

The assignment of the associated LU with respect to the respectiveprocessors 1077A and 1077B can be dynamically changed in accordance withthe load states of the respective processors 1077A and 1077B, thereceipt of an associated processor designation command which designatesan associated processor for each LU and which is supplied by the hostcomputer 101, or an instruction from the management device 102.Furthermore, the assignment of an associated LU to each of theprocessors 1077A and 10778 can also be dynamically changed as a resultof the existence of a fault occurrence in the connection path betweenthe disk control device 107 and the host computer 101 or in theconnection path between the disk control device 107 and magnetic diskdevice 104.

In addition, the local memory units 1076A and 1076B store addressmapping information 205 which indicates the relationship between therespective processors and the associated LU as well as addressinformation on the local memory units of their own system and the othersystem and, upon receipt of a command transmitted by the host computer,judge, based on the address mapping information 205, whether the targetLU is associated with the processor of their own system or the othersystem and, when the LU is associated with the processor of the othersystem, may apply control to transfer the command to the local memoryunit of the other system on the basis of the address mapping information205.

Although a redundant configuration in which two of each of therespective elements in the disk control device 107 are provided isillustrated in this embodiment, the configuration may also have one ofeach of these elements.

The magnetic disk devices 104 are able to assume each of threerotational states, namely, normal mode, power conservation mode, andmotor stoppage mode in accordance with an instruction from the diskcontrol device 107.

Normal mode is a state where the motor of the magnetic disk devicerotates to an extent permitting the magnetic disk device to respond toaccess with respect to the storage areas of the magnetic disk devices104 by the host computer 101.

Power conservation mode is a state where the magnetic disk devices 104are in a standby state with a lower (slower) motor speed than that usedin normal mode. In power conservation mode, the magnetic disk device isable to deal with access by the host computer 101 more rapidly than in astate where the motor is stopped, whereby the effect on performance canbe reduced.

Motor stoppage mode is a standby state where the motors of the magneticdisk devices are stopped while power is supplied to the magnetic diskdevices 104 in order to effect a greater reduction in power consumptionthan for magnetic disk devices 104 which are in power conservation mode.

By shifting the rotational state, i.e. reducing the rotational speed ofthe motors of the magnetic disk devices, the amount of power consumed bythe magnetic disk devices can be reduced. In addition, heat is generatedas a result of the power consumption of the magnetic disk devices andthe rotation of the motors thereof. In order to avoid an increase in thetemperature within the storage system caused by heat emission, theinterior of the control device is cooled by means of a cooling devicesuch as a fan or air conditioner, for example. By reducing therotational speed of the motors of the magnetic disk devices or stoppingthe rotation of the motors, the heat emission within the storage device103 can be reduced and the amount of power consumed by the coolingdevice can be reduced. However, access by the host computer to the LUdisposed in the magnetic disk devices occurs arbitrarily. That is, inorder to reduce the power consumption of the whole storage device 103,the magnetic disk devices must enter an optimum mode in accordance withthe state of access to the LU disposed in the respective magnetic diskdevices 104.

The magnetic disk devices 104 each receive an instruction via thecontrol command transfer line 111 from the processors 1077A and 1077Band shift the rotational state.

FIG. 2 shows the information that the local memory units 1076A and 10768hold. The local memory units store a disk control program 201 forperforming power saving control, LU management information 202 formanaging the access states and configuration information of each LU,RAID group management information 203 for managing the RAID groupconfiguration information, and address mapping information 205. Althoughan example in which the various control programs are stored in localmemory units is shown in this embodiment, the various control programsmay also be stored in the system area of the storage areas of themagnetic disk devices.

FIG. 3 shows LU management information 202. LU management information202 manages the states of access by the host computer 101 for each LU aswell as configuration information for each LU. With the LU managementinformation 202, setting changes are carried out in accordance withinstructions from the management device 102. The LUN (Logical UnitNumber) 301 shows the identification numbers assigned to each LU. Thearray configuration 302 indicates the RAID configuration of the RAIDgroup to which the LU belong. The disk number 303 indicates theidentification numbers of the magnetic disk devices 104 that constitutethe RAID group which corresponds with the LU.

The disk startup state 304 indicates the actual states of the motors ofthe magnetic disk devices 104 comprising the LU. In cases where the diskstartup state is shown as ‘OFF’, this constitutes a state where themotors of the magnetic disk devices 104 corresponding with the LU arestopped and it is not possible to respond immediately to access by thehost computer 101 (because it takes time for the motors of the magneticdisk devices to rotate, access takes time). In cases where the diskstate is shown as ‘ON’, this indicates a state where a magnetic diskdevice corresponding with the LU is able to immediately deal with accessby the host computer 101.

The disk state 305 shows the presence or absence of faults with themagnetic disk devices which correspond with the LU and, in cases where afault is sensed by the disk control program 201, ‘anomaly’ is shown.

The state information 306 is a mode for a magnetic disk device for whichthere has been an instruction from the management device 102 withrespect to the LU. State information 306 is ‘normal’ in an initial stateprior to receiving an instruction from the management device 103.

In cases where state information 306 is ‘normal’, this indicates thatthere has been an instruction from the management device 102 to set themagnetic disk devices 104 which correspond with the LU to an accessiblestate, that is, to normal mode. In cases where state information 306 isa ‘power conservation instruction’, this indicates that there has beenan instruction from the management device 102 to reduce the speed of themotors of the magnetic disk devices 104 which correspond with the LU,that is, to set the magnetic disk devices 104 to power conservationmode. For example, in cases where it is judged that there has been noaccess for a fixed time, the system administrator issues an instructionto reduce the speed of the motors of the magnetic disk devices 104 whichcorrespond with the LU via the management device 102. In cases wherestate information 306 is ‘power OFF instruction’, this indicates thatthere is an instruction from the system administrator via the managementdevice 102 to stop the motors of the magnetic disk devices 104 whichcorrespond with the LU, that is, to place the magnetic disk devices 104in motor stoppage mode.

State information 306 indicates the instruction state from themanagement device 102. The processing to shift the rotational state ofthe actual magnetic disk device 104 is executed by the disk controlprogram 201 after the state information 306 has been changed uponreceipt of an instruction from the management device 102. That is, thisdoes not mean that the content represented by the state information 306indicates the state of the actual magnetic disk device 104 but insteadindicates content with regard to the instruction from the managementdevice 102 with respect to what kind of rotational states the magneticdisk devices which correspond with the LU should be placed in, inresponse to access to the LU. The processing of the disk control program201 will be described subsequently.

The elapsed time 307 stores the elapsed time from the last access to theLU by the host computer 101 until the current time.

So that the LU management information 202 and RAID group managementinformation 203 stored in the two local memory units 1076A and 10768 arealways the same, when the information in the first local memory unit1076A is updated, the information in the other local memory unit 10768is updated in the same fashion. The processing of this information isexecuted by the disk control program 201.

FIG. 4 shows RAID group management information 203 which shows therelationship between the RAID groups and LU. The RAID group managementinformation is set and updated by the system administrator via themanagement device 102. The RG number 401 indicates the identificationnumber of the RAID group and the LUN 402 indicates the identificationnumbers of the LU contained in the RAID groups.

FIG. 5 shows the flow of processing by the disk control program 201. Thedisk control program 201 is executed by the processors 1077A and 10778.The processing by the disk control program 201 starts when the diskcontrol program 201 senses that the system administrator has issued aninstruction to shift the rotational state of the magnetic disk devices104 which correspond with the LU for each LU via the management device102.

The state information 306 of the LU management information 202 ismanaged for each LU but the management device 102 is able to grasp theRAID configuration of the storage device 103. Hence, an instruction tochange the state information 306 from the management device 102 may bemade for each RAID group, for each LU, or for every plurality of LU.

The disk control program 201 controls the rotational state of themagnetic disk devices 104 based on the instruction from the managementdevice 102 and the status of access by the host computer 101. The accessby the host computer 101 is performed in LU units. As will be describedsubsequently, by regularly confirming the elapsed time 307 of the LUmanagement information 202 indicating the state of access by the hostcomputer 101 for each LU, the disk control program 201 is able tocontrol the rotational state of the magnetic disk devices 104 withpinpoint precision in accordance with the states of access by the hostcomputer 101.

An example of the processing that is performed by the disk controlprogram 201 in cases where the disk control program 201 senses that thestorage device has received an instruction from the management device102 to shift the rotational state of the magnetic disk devices 104 whichcorrespond with the LU for each LU will now be described using FIG. 5.

When an instruction to shift the rotational state of the magnetic diskdevices 104 is received from the management device 102, the disk controlprogram 201 first judges whether the state information may be changed inaccordance with the instruction from the management device 102 bychecking same against the state of access by the host computer 101 tothe LU for which the instruction is issued (501). The state of access bythe host computer 101 to the LU for which the instruction is issued isconfirmed by referencing the elapsed time 307. For example, in caseswhere there has been a ‘power conservation instruction’ or a ‘power OFFinstruction’ from the management device 102 with respect to an LU forwhich access by the host computer 101 is continuous, the control program201 judges that the state information is not to be changed.

This is because a change in the rotational state of magnetic diskdevices 104 comprising the LU for which access is continuous wouldengender a fatal fault. In cases where there is an inconsistency betweenthe state of access to the magnetic disk devices 104 and the instructionfrom the management device 102, the disk control program 201 reports anerror to the management device 102 and ends the processing (502).

In addition, even for an LU that is not being accessed by the hostcomputer 101, in cases where there is a ‘power OFF instruction’ from themanagement device (102) for an LU for which state information 306 is‘normal’, the disk control program 201 advances the processing bychanging the ‘power OFF instruction’ to ‘power conservationinstruction’. An LU for which the state information 306 is ‘normal’ hasa high probability of receiving access immediately from the hostcomputer 101. Accordingly, by changing the instruction from themanagement device 102 from a ‘power OFF instruction’ to a ‘powerconservation instruction’, a magnetic disk device is made to standbywith the motor of the magnetic disk device still rotating at a low speedin order to be able to rapidly deal with access by the host computer101. Thereafter, the disk control program 201 changes the stateinformation 306 of the LU for which the instruction is issued to a‘power conservation instruction’.

In the other cases above, the disk control program 201 changes the stateinformation 306 of the LU for which the instruction is issued inaccordance with an instruction from the management device 102 (503).

In cases where the state information 306 of the LU for which aninstruction is issued is changed to ‘normal’ by the disk control program201 (505), the disk number (303) of the LU management information 202 isreferenced, the magnetic disk devices 104 which correspond with the LUfor which the instruction is issued are confirmed, and an instruction isissued to the magnetic disk devices 104 via the control command transferline 111 to place the rotational state of the magnetic disk devices 104in the normal mode of being able to respond to access by the hostcomputer 101 (506).

In cases where the state information 306 is changed to the ‘powerconservation instruction’ by the disk control program 201 (507), theelapsed time 307 of the LU management information 202 is referenced, andit is confirmed whether the LU for which the instruction is issued hasnot been accessed for a predetermined time (508). In cases where the LUhas not been accessed for a predetermined time, that is, in cases wherethe elapsed time 307 of the LU for which the instruction is issuedexceeds a predetermined time, the state information 306 is also changedfrom a ‘power conservation instruction’ to a ‘power OFF instruction’(509). Predetermined time (called ‘predetermined time A’ hereinbelow) asit is meant here is a time which serves as a reference for judging thataccess to the LU by the host computer has not occurred and can beappropriately set by the management device. In addition to cases ofcontrolling a shift in the rotational state of the magnetic disk devices104 which is carried out in the event of an instruction to change thestate information from the management device 102 as per this embodiment,there are also cases where a shift in the rotational state of magneticdisk devices 104 that have not been accessed for a predetermined time(called ‘predetermined time B’ hereinbelow) in the judgment of only thedisk control device 107 of the storage device 103 is controlled.Predetermined time A can be set to a shorter time in comparison withpredetermined time B. This is because the system administrator or thelike who is aware of the operating state of the host computer issues aninstruction to the storage device 103 via the management device 102 andthe time required until it is judged that there has been no access isshorter than predetermined time B which is for a case where a judgmentis made only by the storage device 103.

In cases where a predetermined time A has not elapsed after access islast made to the LU for which the instruction is issued, that is, incases where the elapsed time 307 of the LU for which the instruction isissued does not exceed a predetermined time A, the processing isadvanced with the state information remaining as ‘power conservationinstruction’.

In cases where the state information 306 of the LU for which theinstruction is issued remains as ‘power conservation instruction’, thedisk control program 201 then confirms the state information 306 of theother LU belonging to the same RAID group as the LU for which theinstruction is issued (510). Thereafter, in cases where there is an LUfor which the state information 306 is ‘normal’ in the same RAID group,the disk control program 201 keeps the magnetic disk devices whichcorrespond with the RAID group in normal mode and repeatedly confirms,at fixed intervals, the state information 306 of the LU which belong tothe same RAID group as the LU for which the instruction is issued. Incases where the state information 306 of the LU which belong to the sameRAID group as the LU for which the instruction is issued is all either‘power conservation instruction’ or ‘power OFF instruction’, the diskcontrol program 201 issues an instruction to the magnetic disk devices104 via the control command transfer line 111 so that all of themagnetic disk devices which correspond with the RAID group are placed inpower conservation mode (511).

Thereafter, the disk control program 201 references the elapsed time 308of the LU which belongs to the same RAID group as the LU for which theinstruction is issued and confirms whether access by the host computer101 has not occurred for predetermined time A with respect to all of theLU which belong to the RAID group (512). In cases where access by thehost computer 101 has not occurred for the predetermined time for all ofthe LU, that is, in cases where the elapsed time 307 of all the LU whichbelong to the RAID group exceeds predetermined time A, the disk controlprogram 201 changes the state information 306 of all the LU which belongto the RAID group from ‘power conservation instruction’ to ‘power OFFinstruction’ (513).

When the rotational state of the motors of the magnetic disk devices 104are shifted, all of the storage areas of the magnetic disk devices 104are affected. Hence, not only the access states with respect to the LUfor which the instruction is issued but also the access states withrespect to the other LU in the magnetic disk devices 104 whichcorrespond with the LU for which the instruction is issued must beconsidered in the shifting of the rotational state of the magnetic diskdevices 104. That is, in cases where the LU is a storage area thatextends across a plurality of magnetic disk devices 104 in the RAIDgroup, in order to shift the rotational state of the magnetic diskdevices, the disk control program 201 must confirm that not only thestate information 306 of the LU for which the instruction is issued butalso the state information 306 of the other LU which belong to the sameRAID group as the LU for which the instruction is issued is either‘power conservation instruction’ or ‘power OFF instruction’.

In cases where the state information 306 is changed to ‘power OFFinstruction’ as a result of the processing of the disk control program201 (514, 509, and 513), the disk control program 201 references the LUmanagement information 202 and confirms that the state information 306of the other LU in the same RAID group as the LU whose state information306 has been changed to ‘power OFF instruction’ are all ‘power OFFinstruction’ (515). In cases where there is an LU whose stateinformation 306 is either ‘normal’ or ‘power conservation instruction’,the disk control program 201 repeatedly performs the aforementionedconfirmation at fixed intervals.

In cases where the state information 306 of the other LU in the sameRAID group as the LU whose state information 306 is changed to ‘powerOFF instruction’ are all ‘power OFF instruction’, the disk controlprogram 201 issues an instruction to a disk judgment program 603 whichis stored in the system area of the magnetic disk devices to judgewhether the motors of the magnetic disk devices 104 which correspondwith the RAID group may be stopped (516). The processing of the diskjudgment program 603 will be described subsequently. The disk controlprogram 201 then receives the judgment result from the disk judgmentprogram 603 (517). In cases where the judgment result is that the motorsof the magnetic disk devices 104 which correspond with the RAID groupcan be stopped (518), the disk control program 201 stops the motors ofthe magnetic disk devices which correspond with the RAID group (521).

In addition, in cases where the judgment result of the disk judgmentprogram 603 is the judgment that the motors of the magnetic disk devicescannot be stopped owing to the hardware of the magnetic disk devices orthe like (519), (518, 519), the disk control program 201 issues an errorreport to the management device 102 (520). The disk control program 201also changes the disk state 305 of the LU management information 202 to‘anomalous’.

As is described hereinabove, the LU sometimes extends across the storagearea of a plurality of magnetic disk devices which belong to the RAIDgroup and, therefore, stoppage of the rotation of the magnetic diskdevices must be performed in RAID group units.

The processing in which the disk judgment program 603 judges whether themotor of the respective magnetic disk devices may be stopped will beillustrated hereinbelow by using FIGS. 6, 7, and 8.

FIG. 6 shows a conceptual view of the storage area of the magnetic diskdevices 104. The magnetic disk devices 104 are each assigned a number.The storage area of the magnetic disk devices 104 includes a system area601 in which configuration information and control information of thestorage device 103 as well as information that is managed by the systemadministrator are stored and a user area 602 for storing datatransferred from the host computer. The system area 601 stores diskjudgment program 603 and a disk ON/OFF feasibility information 604. Thedisk judgment program 603 is executed by a processor which the magneticdisk devices 104 comprise.

FIG. 7 shows the disk ON/OFF feasibility information 604. The diskON/OFF feasibility information 604 associates and represents informationidentifying the magnetic disk devices 104 and information on whether themotors of the magnetic disk devices 104 may be stopped. Disk number 701shows the identification numbers of the magnetic disk devices 104 andRAID group number 702 shows the identification number of the RAID groupto which the magnetic disk devices 104 belong. Motor stoppagefeasibility 703 shows whether the rotation of the motors of the magneticdisk devices may be in a stopped state as a result of the judgment bythe disk judgment program 603 and ‘unfeasible’ is shown until aninstruction is received from the disk control program 201.

FIG. 8 shows the process flow of the disk judgment program 603. Theprocessing of the disk judgment program 603 starts in the event that thedisk judgment program 603 senses an instruction to start the judgmentprocessing from the disk control program 201 (514). At the moment whenthe processing of the disk judgment program 603 starts, the motorstoppage feasibility 703 of the disk ON/OFF feasibility information 604of the magnetic disk devices for which a judgment instruction is issuedis set to unfeasible. The disk judgment program 603 judges whether themotor stoppage feasibility 703 which corresponds with the magnetic diskdevices for which the judgment instruction is issued by the disk controlprogram 201 is feasible or not (801). The processing is terminated and areport to that effect is sent to the disk control program 201 in caseswhere motor stoppage feasibility 703 is possible. However, in caseswhere the motor stoppage feasibility 703 is unfeasible, the diskjudgment program 603 judges that, in cases where data which have notbeen written to the magnetic disk devices for which the judgmentinstruction is issued by the disk control program 201 are not present inthe cache memory, de-staging is unnecessary and that, in cases wherethere are unwritten data in the cache memory, de-staging is required(802). Further, in cases where data which have not been written to themagnetic disk devices are not stored in the cache memory units 1074A and10748, the disk judgment program 603 sets the motor stoppage feasibility703 of the disk ON/OFF feasibility information 604 to feasible (804) andreports the judgment result to the disk control program 201. In caseswhere data which have not been written to the magnetic disk devices havebeen stored in the cache memory units 1074A and 10748, these data arewritten (de-staged) to the storage area of the magnetic disk devices(803). Thereafter, the motor stoppage feasibility 703 of the disk ON/OFFfeasibility information 604 is set to feasible (804) and the judgmentresult is reported to the disk control program 201.

The other embodiments will be described hereinbelow.

This embodiment illustrates a technology that is related to power supplycontrol for a storage device having two types of enclosure, namely abase enclosure and an additional enclosure. The base enclosure comprisesa control unit for controlling the whole storage device, magnetic diskdevices, a power supply, upper and lower I/O interfaces and theadditional enclosure which is intended as expansion, comprises magneticdisk devices, a power supply, and an I/O interface. The base enclosure,which possesses a control function, is able to provide a storagecapacity which meets the desires of the user by connecting a pluralityof additional enclosures which provide a storage area.

In the above embodiment, a technology that reduces power consumption byshifting magnetic disk devices that have not been accessed for a fixedperiod to a power saving mode. However, the above embodiment is limitedto curtailing the amount of power consumption of the magnetic diskdevices. There is room for further improvement in the case of a storagedevice that comprises a plurality of enclosures as per this embodiment.That is, there is scope for reducing the amount of power consumption notonly of the magnetic disk devices but also that of the cooling deviceand control unit within an enclosure. This embodiment describes atechnology relating to control of the supply of power to the storagedevice which makes it possible to reduce the power consumption stillfurther by performing power conservation control not only in magneticdisk device units but also in enclosure units while maintaining thefunction for transmitting signals between the enclosures of the storagedevice and judging the state of access to the storage areas of thestorage system.

FIG. 9 shows a configuration view of a second storage system of thepresent invention.

The storage device comprises a base enclosure 903 and an additionalenclosure 905 and comprises one base enclosure and one or moreadditional enclosures, for example.

The base enclosure 903 is connected via a network to a host computer 901and a management device 902, and is connected via a cable 906 toadditional enclosures 905 and performs control of the whole storagedevice also including the additional enclosures. Hence, the baseenclosure 903 holds system configuration information for the purpose ofmanaging the configuration of the whole storage device and the powersupply state.

The data which are stored in the storage areas of the magnetic diskdevices which the base enclosure 903 and additional enclosures 905comprise are afforded RAID-based redundancy. The storage device suppliesan upper device such as the host computer 901 with the storage area ofthe magnetic disk devices as a logical storage area (LU). The storageareas of the base enclosure 903 and additional enclosures 905 compriseone or more RAID groups. Further, a plurality of LU are included in aRAID group.

The management device 902 is used by the system administrator, forexample, and makes changes to the settings of the configurationinformation of the storage device such as changes to the RAID groupsettings or changes to the settings of the LU management informationwhich is stored in a memory 9040 of the base enclosure 903. The hostcomputer 901 issues commands to read and write data with respect to thestorage areas of the magnetic disk devices of the storage device via anetwork. The base enclosure 903 accesses magnetic disk devices 9056 and9036 of the base enclosure 903 or additional enclosures 905 inaccordance with a command that is issued by the host computer 901 andreads and writes data with respect to the storage areas of the magneticdisk devices 9056 and 9036.

The base enclosure 903 comprises a power supply unit 9031, a host I/F9032, an inter-enclosure connection controller 9033, a storage mediumpackage 9034, a power supply control circuit 9037, a control unit 9038,and a cache memory 9041. The control unit 9038 comprises a processor9039 and a memory 9040 and the memory 9040 stores the disk controlprogram 201 mentioned in the previous embodiment, the LU managementinformation 202 shown in FIG. 3, the RAID group management information(RG management information) 203 shown in FIG. 4, and the enclosuremanagement information 204 which will be described subsequently.

The storage medium package 9034 contains involatile storage media suchas a plurality of magnetic disk devices 9036, for example, and a FAN9035 or the like for cooling the involatile storage media. As per theprevious embodiment, the magnetic disk devices are able to assume threerotational states, namely, a normal mode, a power conservation mode, anda motor stoppage mode in response to an instruction from a higher leveldevice.

Power is supplied to the base enclosure 903 via the power supply unit9031. In this embodiment, a redundant configuration in which two of eachof the power supply unit 9031 are provided in each enclosure is shownbut there may also be any number of power supply units 9031. The powersupply control circuit 9037 receives an instruction from the controlunit 9038 and performs ON/OFF switching of the power supply to thestorage medium package 9034. The base enclosure 903 is connected to ahost computer 901 via a host I/F 9032 and a SAN network or other networkand the additional enclosures 905 are connected to the host computer 901via the base enclosure 903. The additional enclosures 905 comprise apower supply unit 9051, a power supply control circuit 9057, a storagemedium package 9054, and an inter-enclosure connection controller 9053.The additional enclosures 905 is connected to the base enclosure 903 viathe inter-enclosure connection controller 9053. The base enclosure 903and additional enclosures 905 are cascade-connected to one another. Thatis, the transmission of signals and data between the base enclosure 903and the additional enclosures 905 and between the additional enclosures905 is performed via a cable 906 in the order of the adjacentenclosures.

The storage area of the storage device comprises a plurality of RAIDgroups. Further, the reading and writing of data from the host computer901 to the storage device are carried out with the LU, which constitutesa portion of the storage area in the RAID group, as the target. Thestorage device comprises a plurality of RAID groups and a RAID groupcomprises a plurality of LU. In cases where read and write commands fromthe host computer to the storage device are issued, the commands arefirst received by the base enclosure 903.

The disk control program 201 of the base enclosure 903 judges, based onthe LU management information 202 and enclosure management information204, which storage area in the base enclosure 903 or the additionalenclosure 905 among the additional enclosures 905 the magnetic diskdevice comprising the storage area constituting the access target islocated in and, if the case of a storage area which is contained in amagnetic disk device of an additional enclosure, the disk controlprogram 201 transmits a command to the additional enclosure 905 via theinter-enclosure controllers 9033 and 9053.

The system area of the magnetic disk device which each enclosurecomprises stores disk judgment program 603 for judging whether themotors of the magnetic disk devices of the respective enclosures may bestopped and ON/OFF feasibility information 604 which holds the judgmentresult of the disk judgment program 603.

The disk control program 201 and disk judgment program 603 reference theLU management information 202 and RAID group management information 203in the event of an instruction from the management device 902 as per theabove embodiment and performs control to shift the rotational state ofthe magnetic disk device corresponding with the state of access withrespect to the storage device by the host computer 901 in accordancewith the flow in FIG. 5.

FIG. 10 shows enclosure management information 204 of the memory 9040 ofthe base enclosure and shows the power supply state for each enclosure.The enclosure number 1001 is the identification number of each enclosureand the enclosure state 1002 indicates the power supply state of eachenclosure. For example, in cases where the enclosure state 1002 is‘package ON’, this indicates that power is being supplied to the wholeenclosure including the storage medium package in the enclosure. Incases where the enclosure state 1002 is ‘package OFF’, this indicatesthat power is being supplied to the enclosure except for the packagepart. In cases where the enclosure state 1002 is ‘enclosure power OFF’,this indicates that power is not being supplied to the enclosure. Thedisk number 1003 indicates identification information on the magneticdisk devices of each enclosure. The enclosure management information 204is updated by the disk control program whenever the power supply stateis actually switched.

The disk control program advances the processing in accordance with theflow shown in FIG. 5 and the processing which is then carried out by thedisk control program 201 in cases where the rotation of the motors ofthe magnetic disk devices is stopped will be described hereinbelow byusing FIG. 11.

FIG. 11 shows the flow of power OFF processing in enclosure units by thedisk control program 201.

The processing shown in FIG. 11 is started given that the motors of themagnetic disk devices are stopped in RAID group units (521) as a resultof the processing of the disk control program 201 shown in FIG. 5.First, the disk control program 201 references the enclosure managementinformation 204 and specifies an enclosure in which magnetic diskdevices whose motors have stopped are contained. The disk controlprogram 201 then judges whether all of the motors of the magnetic diskdevices in the enclosure have entered the motor stoppage mode (1101). Incases where there is a magnetic disk device whose motor has not beenstopped in the enclosure, the processing of the disk control program 201is terminated. In cases where all of the motors of the magnetic diskdevices in the enclosure have stopped, the disk control program 201issues an instruction to the power supply control circuit to stop thesupply of power to the storage medium package 9034 of the enclosure(1102). The disk control program 201 also references the enclosuremanagement information 204 and judges whether the enclosure states 1002corresponding with the enclosures downstream of the enclosure for whichthe supply of power to the storage medium package 9034 has been stoppedare all ‘package OFF’ or ‘enclosure power OFF’ (1103). Further, in caseswhere the enclosure states of the enclosure management informationcorresponding with the enclosures downstream of the former enclosure areall ‘package OFF’ or ‘enclosure power OFF’, the supply of power to theseenclosures is stopped (1104). In other cases, the processing isterminated. An enclosure which is downstream of the enclosure for whichthe supply of power to the storage medium package has been stopped is anenclosure which transmits signals to the base enclosure via theinter-enclosure controller 9033 of the enclosure for which the supply ofpower to the storage medium package 9034 has been stopped.

The storage device of this embodiment has a configuration in which aplurality of enclosures are cascade-connected. Access to the storageareas of the respective enclosures by the host computer 901 is made viathe inter-enclosure controller 9033 of the enclosure upstream of all thebase enclosures 903 and the enclosure being accessed. Hence, power mustbe supplied to the enclosure upstream of the enclosure being accessed.However, it is not necessary to supply power to the storage mediumpackage 9034 unless all of the magnetic disk devices in the enclosureare accessed. Very precise power supply control which considers powerconservation effects is possible by stopping the supply of power to thestorage medium package 9034 while supplying power to parts other thanthe storage medium package 9034 of the enclosure by switching the powersupply control circuit 9037 of each enclosure.

FIG. 12 shows, by way of a conceptual view, the flow of all theprocessing of the power supply control of this embodiment.

A disk array device comprises a plurality of enclosures and eachenclosure comprises a plurality of RAID groups (RG). Further, a RAIDgroup (RG) comprises a plurality of LU. First, when the stateinformation for all of the LU in the RG is ‘power conservationinstruction’, the motors of the magnetic disk devices which constitutethe RG are stopped. Further, in cases where the motors of all the RG inthe enclosure are stopped, the transmission of power to the storagemedium package is OFF. In addition, if the power supply is OFF forenclosures which are downstream of an enclosure for which thetransmission of power to the storage medium package is OFF, the supplyof power to the former enclosure is OFF. Thus, power conservationcontrol which corresponds with the state of access by the host computeris implemented.

FIG. 13 is an example of a screen display for a device such as themanagement device 902. The system administrator or the like is able tograsp the power supply control state of the whole system at the currenttime from the screen in FIG. 13. Information relating to the powersupply control is transmitted by the disk control program 201 of thebase enclosure 903 to the management program 9021 which is executed bythe management device 902 and the management program is displayed on thescreen of the management device 902. Information may also be displayedon a computer other than the management device 902.

Enclosure 01 is an example of a screen display which schematically showsthe LU and the RAID groups of the base enclosure (1301). Enclosure 08 isan example of a screen display which schematically shows the LU and theRAID groups of the additional enclosure (1302). The storage area whichis indicated by the oblique lines on the screen display indicates thatthe motor of the magnetic disk device which corresponds with the storagearea has stopped. The power supply control of the storage device isimplemented by taking physical structures as units such as the magneticdisk device units and enclosure units or the like. In contrast, accessby the host computer is performed by taking the LU, which constitutelogical storage areas, as units. Thus, a visual representation in whichthe states of the power supply control performed in physical units andthe logical storage areas are mapped facilitates management by thesystem administrator. In addition, the RAID groups may be storage areasof magnetic disk devices which are in the same enclosure or may bestorage areas which extend across the storage areas of a plurality ofenclosures. The RAID configuration and power supply states can also begrasped visually in cases where a RAID group extends across the storageareas of a plurality of enclosures.

Power supply control history information 1303 associates and stores timeinformation for when the disk control program 201 has performed powersupply control up to the present, as well as information on enclosureswhich are to be control targets, RAID groups which are to be controltargets, and LU which are to be control targets.

The time 1305 of the power supply control status history informationindicates the times when power supply control is performed. The powersupply control target 1306 indicates the identification number of theenclosure having the storage area constituting the power supply controltarget, the RAID group identification number, and the LUN and so forth.The control content 1307 indicates the specific nature of the powersupply control that is actually performed. The power supply controlhistory information 1303 may also be automatically updated in the eventthat the disk control program 201 transmits this information to themanagement program when the power supply control by the disk controlprogram 201 is performed.

The power supply control information 1304 is an example of a screendisplay for indicating the current status of the power supply of thestorage area units selected by the system administrator or the like. Thenumber of RAID groups and LU of the storage device is huge and it isseemingly difficult to grasp the power supply states of the specifiedRAID groups and LU only visually. Hence, management by the systemadministrator is facilitated by showing the power supply states for themagnetic disk devices which correspond with the LUN, enclosureidentification number, and the storage area of the RAID groupidentification number which are selected on the screen by the systemadministrator. The storage media of the storage device were described asmagnetic disk devices in this embodiment. However, it is understood thatthe storage media may also be involatile storage media such as flashmemory or storage media of another type.

1. A storage system comprising: a first enclosure including a firstcontroller and a first storage medium package which includes a pluralityof first disk devices and a first fan; a second enclosure, which iscoupled to said first enclosure, including a second controller and asecond storage medium package which includes a plurality of second diskdevices and a second fan; and a third enclosure, which is coupled tosaid second enclosure, including a third controller and a third storagemedium package which includes a plurality of third disk devices and athird fan, wherein a state of each of said second disks is changed to amotor stopping mode when each of said second disks has not been accessedfor more than a predetermined time, wherein if the state of all of saidsecond disks is said motor stopping mode, said first controllerinstructs to stop power supply to said second storage medium package,and wherein if power supply to second storage medium package is stoppedand said power supply to said third enclosure or said third storagemedium package is stopped, said first controller controls to stop powersupply to said second enclosure.
 2. A storage system according to claim1, wherein said second enclosure is coupled to said first enclosure viasaid second controller, and said third enclosure is coupled to saidsecond enclosure via said second controller and said third controller.3. A storage system according to claim 2, further comprising: a firstcable between said first enclosure and said second enclosure; and asecond cable between said second enclosure and said third enclosure. 4.A storage system according to claim 3, wherein even when power supply tosaid second storage medium package is stopped, data is transferredbetween said second enclosure and said third enclosure via said secondcontroller and said second cable and said third controller.
 5. A storagesystem according to claim 1, wherein said first enclosure is a baseenclosure, and said second enclosure and said third enclosure areadditional enclosures.
 6. A storage system according to claim 1, whereinsaid motor stopping mode is a standby state where a motor in said seconddisk device is stopped while power is supplied to said second diskdevice.
 7. A storage system according to claim 1, wherein a state ofeach of said third disks is changed to said motor stopping mode wheneach of said third disks has not been accessed for more than apredetermined time, wherein if the state of all of said third disks issaid motor stopping mode, said first controller instructs to stop powersupply to said third storage medium package or controls to stop powersupply to said third enclosure.
 8. A storage system comprising: a firstenclosure including a first controller; a second enclosure, which iscoupled to said first enclosure, including a second controller and asecond storage medium package which includes a plurality of second diskdevices and a first fan; and a third enclosure, which is coupled to saidsecond enclosure, including a third controller and a third storagemedium package which includes a plurality of third disk devices and asecond fan, wherein a state of each of said second disks is changed tomotor stopping mode when each of said second disks has not been accessedfor more than a predetermined time, wherein if the state of all of saidsecond disks is said motor stopping mode, power supply to said secondstorage medium package is controlled to stop, and wherein if powersupply to second storage medium package is stopped and said power supplyto said third enclosure or said third storage medium package is stopped,power supply to said second enclosure is controlled to stop.
 9. Astorage system according to claim 8, wherein a state of each of saidthird disks is changed to said motor stopping mode when each of saidthird disks has not been accessed for more than a predetermined time,and wherein if the state of all of said third disks is said motorstopping mode, power supply to said third storage medium package orpower supply to said third enclosure is controlled to stop.
 10. Astorage system comprising: a first enclosure including a firstcontroller; a second enclosure, which is coupled to said firstenclosure, including a second controller and a second storage mediumpackage which includes a plurality of second disk devices and a firstfan; and a third enclosure, which is coupled to said second enclosure,including a third controller and a third storage medium package whichincludes a plurality of third disk devices and a second fan, wherein astate of each of said second disks is changed to power saving mode wheneach of said second disks has not been accessed for more than apredetermined time, wherein if the state of all of said second disks issaid power saving mode, power supply to said second storage mediumpackage is controlled to stop, and wherein if power supply to secondstorage medium package is stopped and said power supply to said thirdenclosure or said third storage medium package is stopped, power supplyto said second enclosure is controlled to stop.
 11. A storage systemaccording to claim 10, wherein a state of each of said third disks ischanged to said power saving mode when each of said third disks has notbeen accessed for more than a predetermined time, and wherein if thestate of all of said third disks is said power saving mode, power supplyto said third storage medium package or power supply to said thirdenclosure is controlled to stop.